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ABSTRACT 

In this, the second in a series of three papers concerning the SuperCOSMOS Sky Sur- 
vey, we describe the methods for image detection, parameterisation, classification and 
photometry. We demonstrate the internal and external accuracy of our object param- 
eters. Using examples from the first release of data, the South Galactic Cap survey, 
we show that our image detection completeness is close to 100% to within '--^ 1.5 mag 
of the nominal plate limits. We show that for the Bj survey data, the image classifi- 
cation is externally > 99% reliable to Bj ^ 19.5. Internally, the image classification 
is reliable at a level of > 90% to Bj ~ 21, R ~ 19. The photometric accuracy of our 
data is typically ~ 0.3 mag with respect to external data for m > 15. Internally, the 
relative photometric accuracy in restricted position and magnitude ranges can be as 
accurate as ~ 5% for well exposed stellar images. Colours are externally accurate to 
ctb-r.r-i ~ 0.07 at Bj ~ 16.5 rising to ctb-r.r-i ^ 0.16 at Bj ~ 20. 

Key words: astronomical databases: miscellaneous - catalogues - surveys - methods: 
data analysis - techniques: image processing, photometric 



1 INTRODUCTION 

In Paper I of this series (Hambly et al. 2001a) we de- 
scribe the SuperCOSMOS Sky Survey programme (hereafter 
SSS). This project is to scan the multi-colour/multi-epoch 
Schmidt photographic atlas material to produce a digitised 
survey of the sky in three colours (BRI), one colour (R) 
at two epochs. The ultimate aim of the project is to cover 
the entire sky; the first release of data from the programme 
was the South Galactic Cap (hereafter SGC) survey. The 
SGC survey covers ~ 5000 square degrees of the southern 
sky at Galactic latitudes |6| > 60°. Paper III in this series 
describes the derivation of the astrometric parameters for 
the SSS (Hambly et al. 2001b). In this, the second paper of 
the series, we describe in some detail the image detection, 
parameterisation and classification procedures for the pro- 
gramme. We also describe the techniques for photometric 
cafibration. 

Paper I is intended as a User Guide for the survey 
data. It describes the database organisation and demon- 
strates specific examples of the use of the data. Papers II 
and III provide technical details concerning the derivation 
of object catalogue parameters available from the survey 
database, and also demonstrate the precision of these with 



respect to external data from other sources. In these papers, 
we demonstrate examples and make comparisons using data 
from the SGC survey. All the results, however, are generally 
appficable to the SSS data as a whole at Galactic latitudes 
l&l > 30°. At lower latitudes, image crowding will of course 
degrade astrometric and photometric performance. 

The plate material used in the SSS is detailed in Pa- 
per I, and consists of sky-limited Schmidt photographic 
glass plate and film originals, or glass copies of glass origi- 
nals, taken with the UK, ESO and Palomar Oschin Schmidt 
Telescopes (for more details see Morgan et al. 1992 and ref- 
erences therein). Hereafter, the SERC-J/EJ survey will be 
refered to simply as the J survey; the AAO-R/SERC-ER 
as the R survey and the SERC-I as the I survey. 



2 METHODOLOGY 
2.1 Pixel analysis 

2.1.1 Digitisation 

The SuperCOSMOS machine digitises the scanned photo- 
graph at lOfim resolution (0.67 arcsec) in 15-bit grey lev- 
els (further details of the hardware are given in Hambly et 
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al. 1998). Pixel values are in transmission units and are re- 
lated to incident intensity via the usual transmission (T), 
density (D) and intensity (I) relationships on the the log- 
linear part of the photographic characteristic curve: 



Tlogio / oc D oc logio(l/T), 



(1) 



where 7 is the gradient of the response in the D — logj^g ^ 
plane. Each photograph is digitised in ~ 2 hr, producing 
~ 2 Gbyte of data. The pixel map is 'blocked down' in 64x64 
pixel cells during scanning to produce a raw median sky 
map; all subsequent processing is then carried out offline. 



2.1.2 Sky estimation 

Object detection in the raw pixel map requires an estimate 
of the sky value at every point within the two-dimensional 
image. The raw median values of the sky are first filtered 
using a weighted median filter (Brownrigg 1984) of the form 

1 1 1 
1 3 1 
1 1 1 

applied over filtering scales of 3,2 and 1 blocks (e.g. Sto- 
bie 1986) . This results in variations in scale size greater than 
~ 2 mm (or ^ 2 arcmin) being retained as sky variations 
in the background array. This is obviously a compromise 
between detection of low surface brightness objects at this 
scale or greater and accurate image parameterisation in ar- 
eas of real sky variation. Experiments with differing scale 
lengths showed that the choice of raw blocking of 0.64 mm 
squares coupled with iterative filtering scale sizes of 3,2 and 1 
(e.g. Stobie 1986) is optimal in terms of minimising spuri- 
ous detections due to, for example, low level scattered light 
around bright stars (see also MacGillivray & Stobie 1984). 



2.1.3 Transmission to intensity calibration 

Most modern Schmidt photographs have one or more in- 
tensity calibration 'step-wedges' exposed on their sides or 
corners (see, for example, Tritton 1983). In principle, it is 
possible to measure the transmission of the various intensity 
spots and calibrate measured transmission into incident rel- 
ative intensity. In practice, however, there are a number of 
reasons why even rough step-wedge calibration is difhcult to 
achieve. Variations in emulsion sensitivity across the plate 
and non-uniform illumination of the wedge spots themselves 
can cause large errors. Also, the fact that SuperCOSMOS 
scans by illumination of a strip of emulsion means that when 
imaging dense spots, the light levels recorded are badly af- 
fected by diffracted light originating from relatively clear 
emulsion surrounding the spot. Measurements of the 16- 
spot KPNO step wedges on modern UKSTU plates using a 
Macbeth diffuse densitometer show values of characteristic 
curve gradient 7 ~ 2 ± 10% (indeed, such a value is a crite- 
rion for a survey-grade plate and inclusion in the final UK- 
STU photographic sky atlases). Hence, we use an assumed 
transmission-to-intensity calibration with slope correspond- 
ing to 7 « 2 (older glass copies of the POSS-I 'E' survey 
are analysed using a different value due to their poorer con- 
trast) . This procedu re is vindicated not least by the demon- 
stration, in Section 2.3.1 , of measurements of the gradients 



of the instrumental magnitude scale with respect to CCD 
data, where we show that the resulting scales have close to 
unit gradient with scatter at around the 10% level. 



2.1.^ Thresholding and image parameterisation 

Image detection requires thresholding above a local sky esti- 
mate. The SSS pixel data are thresholded at a constant per- 
centage cut above local sky. A sky noise parameter ctskyq 
is determined from a small (~ 10 x 10 arcmin) region of 
image data near the centre of each plate. The sky back- 
ground value Iskyq in this region is determined as described 
in Beard, MacGillivray & Thanisch (1990) while the ctskyo 
estimator uses the relationship (Jskyq = 1.48 x MADskyq, 
where MADskyq is the median of absolute deviations of the 
pixel values about the sky estimate (the factor 1.48 is the 
scaling needed to convert the MAD estimate for a Gaussian 
distribution to the equivalent Gaussian sigma). This yields 
a 'percentage cut' value PCUT (= 100 x 2.3(tskyo /^skyq) 
such that at any part of the image 

(100 PCUT) ^ 

J THRESH = X -/sky. 



100 



(2) 



where /thresh is the threshold intensity, /sky is the lo- 
cal sky estimate based on bilinear interpolation of the fil- 
tered sky map, and PCUT is typically 7% for survey plates. 
Note that this single-parameter multiplicative thresholding 
is designed to cope with the fact that the sky level (and 
therefore the noise level) changes over the Schmidt field due 
to vignetting, and a zeropoint sky intensity value is chosen 
for the transmission-to-intensity calibration such that the 
threshold, in terms of the number of a above sky, remains 
constant regardless of sky variations. 

Once the thresholded pixel set has been determined for 
a plate, pixel connectivity can be analysed and image param- 
eterisation can take place. The details of the COSMOS pixel 
analyser are contained in MacGillivray & Stobie (1984), Sto- 
bie (1986) and Beard et al. (1990) and will not be repeated 
here; however we describe some details that are relevant to 
SuperCOSMOS survey data. Many of the techniques used 
are similar to those developed for the APM machine (e.g. Ir- 
win 1985; Draper & Eaton 1999). 

For image detection, 8-fold nearest-neighbour pixel 
connectivity is used along with a criterion that images must 
contain at least 8 connected pixels (the so-called 'area cut'). 
Rethresholding at 16 levels defined as in Beard et al. (1990) 
Equation 2, with a 'deblend parameter' of D= 1.05 detects 
multiple objects via fragmentation; in addition rethreshold- 
ing at 8 levels defined by 



/APn ~ /thresh -1-2" X ctskyq = 1,2, ...8 



(3) 



is used to supply a set of 8 'areal profiles' at intensity levels 
of 0.5cr, la, 2a, ia ...above the zeroth areal profile level 
/thresh. These provide a set of measurements of the areal 
extent (in pixels) of each image, an d can be used for mor- 
phological classification (see Section 2.2.3). Note that these 



areal profile levels are different to those originally used for 
the COSMOS machine (MacGillivray & Stobie 1984). This 
new definition provides optimal levels for stellar profile anal- 
ysis (see later). 

Object parameterisation proceeds for each detected ob- 
ject as described in Stobie (1986), with both unit-weighted 
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No. 


Name 


Description 


Units 


1 


RA 


Celestial Right Ascension 


10~* radians 


2 


DEC 


Celestial Declination 


10~® radians 


3 


XMIN 


Left extent 


0.01 micron 


4 


XMAX 


Right extent 


0.01 micron 


5 


YMIN 


Bottom extent 


0.01 micron 


6 


YMAX 


Top extent 


0.01 micron 


7 


AREA 


Total area 


Pixels 


8 


IPEAK 


Peak intensity 


Intensity 


9 


COSMAG 


Isophotal magnitude 


Millimag 


10 


ISKY 


Sky intensity at (XCENJ.YCEN J) 


Intensity 


11 


XCENJ 


Intensity weighted X centroid 


0.01 micron 


12 


YCENJ 


Intensity weighted Y centroid 


0.01 micron 


13 


A.U 


Unweighted semi-major axis 


0.01 micron 


14 


B.U 


Unweighted semi-minor axis 


0.01 micron 


15 


THETA.U 


Unweighted orientation 


Degrees 


16 


AJ 


Weighted semi-major axis 


0.01 micron 


17 


BJ 


Weighted semi-minor axis 


0.01 micron 


18 


THETAJ 


Weighted orientation 


Degrees 


19 


CLASS 


Classification fiag 




20 


P.A 


Celestial position angle 


Degrees 


21 


AP(1) 


Area above areal profile level 1 


Pixels 


22 


AP(2) 


Area above areal profile level 2 


Pixels 


23 


AP(3) 


Area above areal profile level 3 


Pixels 


24 


AP(4) 


Area above areal profile level 4 


Pixels 


25 


AP(5) 


Area above areal profile level 5 


Pixels 


26 


AP(6) 


Area above areal profile level 6 


Pixels 


27 


AP(7) 


Area above areal profile level 7 


Pixels 


28 


AP(8) 


Area above areal profile level 8 


Pixels 


29 


BLEND 


Deblending flag 




30 


QUALITY 


Quality flag 




31 


N(0,1) 


Profile classification statistic 


a X 1000 


32 


PRFMAG 


Profile magnitude 


Millimag 



Comments 

see Section 
see Section 



2.2.1 



2.2.1 



^PEAK _ 
Equation ^ 
^SKY, e.g. Equation 



2.2.3 



see Section 
see Section |2.2.1 
Equation js] 



see text 
see text 
see Section 
see Section 



2.2.3 



2.2.3 



Table 1. The full 32— parameter Image Analysis Mode set for each detected image, along with units for internal storage as 4-byte integers. 



and intensity-weighted parameters being calculated and 
recorded. The object catalogue produced for each plate con- 
sists of an unformatted binary file (the Image Analysis 
Mode, or lAM, file) of record length 128 bytes consisting 
of 32 4-byte parameters per object; depending on Galactic 
latitude and plate depth, this individual plate catalogue can 
have anything from 10^ to 10^ entries. Table |l| details the 
full 32 parameter set available for each image. Note that pa- 
rameters 1,2,19,20,31 and 32 and some bits in parameter 30 
are derived at a second stage of processing (known collo- 
quially as 'post processing' - see Section Ej) Most of the 
parameters in Table |l| are self-explanatory and are derived 
directly from first- and second-order moments as described 
in Stobie (1986). However, the following gives further details 
of some of the more esoteric parameters: 

• COSMAG (parameter 9) is calculated from the sky- 
subtracted intensity values in the connected pixels compris- 
ing an image, 

n 

COSMAG = {-2.5 logio ^(/» - /sky)} x 10^ (4) 

i = l 

and is given in units of millimags. COSMAG is an isophotal 
magnitude. 

• BLEND: each image is deblended by rethresholding at 
the levels defined by Equation 2, Beard et al. (1990); at each 
level, the connected pixels are examined for fragmentation 



and pixels are assigned to fragments using the algorithm 
described therein. Images which have been deblended have 
this indicated in the deblending flag. A code of —n means 
the image is the parent image of n components, while a code 
of +n means that the image is the n*'^ deblended component 
(or child image) of a parent image. Both parents and children 
are retained in the lAM datasets of each plate in the SSS. 

• QUALITY is an integer flag of 32 bits that are used 
to record situations encountered during the analysis of the 
pixels comprising an image. Bits are set in such a way that 
increasingly severe conditions are flagged by setting increas- 
ingly significant bits in the quality flag. Table ^ gives more 
information; other bits in the quality flag are either used 
only for image analysis software debugging or are totally 
unused. 



2.2 Post— processing 

The post-processing of each field consists of a number of 
separate tasks that either overwrite information into the in- 
dividual plate catalogue files, or that write separate files of 
data or coefficients. Paper I describes the SSS database or- 
ganisation and the system that is used to associate individ- 
ual plate files into a seamless whole. Here, we describe the 
post-processing tasks that operate on each plate datafile to 
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Event 



Bit 

set 



Incr- 
ement 



Severity 



Comments 



Orientation calculation failed 
EUipticity calculation failed 
Image too multiple for deblending 
Bright image 

Large image 
Possible bad image 

Image near very bright star 

Image touches boundary 



7 

10 



1 
2 
4 
16 

64 
128 



Information 
Information 
Warning 
Information 

Warning 
Warning 



1024 Warning 



Image perfectly round 

Image perfectly straight 

Image split into too many fragments 

Image has pixels brighter than highest areal profile level 
(as defined by n = 8 in Equation H) 

Image has area greater than maximum specified for deblending 
Image is in region likely to be affected by step wedge or label 
(see Section 2.2.2 ) 

Image may be spu rious due to bright star 
(see Section |2.2.^) 



16 65536 Severe defect Image may be partially missing 



Table 2. lAM parameter 30 (quality flag) details. 



produce astrometric, photometric and image quality/class 
information. 



2.2.1 Astrometry: parameters 1,2 and 20 

Paper III describes the derivation of global astrometric solu- 
tions for each plate. lAM parameters 11,12 and 18 (respec- 
tively X, y and ellipse-fit position angle with respect to tiie 
X axis) are transformed into celestial co-ordinates and a ce- 
lestial position angle which are written to lAM parameters 
1,2 and 20 respectively. Proper motion information is also 
derived between J/R plate pairs and held in separate files, 
as described in Paper III. 

2.2.2 Additional quality information 

Two additional bits in the quality flag that are unused by the 
image analysis software are derived at the post-processing 
stage. Firstly, bit 7 is used to flag images that are in parts 
of the plate that are likely to be affected by the presence 
of step-wedges and labels. Areas of the plate are flagged in 
this way based on the number and the telescope origin of 
the plate. For example, all UK Schmidt plates have a 7-step 
wedge in the west of their northern edge; in addition plate 
numbers 1 through to 5715 have a 7-step wedge on their 
eastern edge; plate numbers 5716 through to 6295 have a 
16-step KPNO wedge in the same eastern position of the 
7-step wedge while all plates numbered from 6296 have a 
16-spot KPNO wedge in their south-eastern corner. The 
survey plate sets used have been examined for wedges and 
labels, and all images in the appropriate areas have bit 7 
set to indicate possible spurious nature or probable poorly 
determined parameters in the case of real images. Because 
the modern survey field centres are spaced to provide ~ 0.5° 
overlap regions between adjacent plates, all parts of the sky 
affected by a wedge or label on one plate will be unaffected 
on one or other of the adjacent plates, so these flags are 
used when creating 'seamless' catalogues across plate over- 
lap boundaries when making a decision as to which image 
to include out of a set of duplicates (see Paper I) . 

Secondly, possible problems with an image due to the 
proximity of a very bright star are indicated using bit 10. 
Tritton (1983) describes in some detail the origin and ap- 
pearance of various image defects around bright stars on 



sky-limited UK Schmidt plates. While the background fol- 
lowing algorithm is tuned to minimse spurious image detec- 
tions around bright stars, there are inevitably many false 
detections due to, for example, diffraction spikes. Within 
the SGC survey, 10 plates from each survey colour were ex- 
amined by eye to measure an exclusion radius, as a func- 
tion of magnitude, around bright stars. In Figure |^ we show 
these measurements and the log-linear fit as a function of 
magnitude that is used to flag images as potentially spu- 
rious. We have been conservative in this procedure in the 
sense that we have flagged all images that could possibly be 
spurious, including of course some real images. Hence, if a 
relatively uncontaminated (but incomplete) catalogue is re- 
quired to be extracted from the survey data, this quality bit 
can be tested. The flagging procedure is completely auto- 
matic: given the fit as a function of magnitude and a bright- 
ness threshold above which to apply it, a list of bright stars 
is extracted from each lAM dataset and a record kept of the 
magnitude and radius for flagging. These lists are recorded 
for each field and can be used for estimating spatial sampling 
completeness when employing this flag. 



2.2.3 Image classification 

The SSS data are image classified using a two-stage classi- 
fication scheme originally developed for APM data (e.g. Ir- 
win & McMahon 1992). The first classification stage makes 
use of conventional techniques such as surface bright- 
ness and image shape (cf. Heydon-Dumbleton, Collins & 
MacGillivray 1989; MacGillivray & Stobie 1984). The sec- 
ond stage uses a series of image parameters to trace the 
stellar locus in a multidimensional parameter space and then 
quantify the similarity, or otherwise, of an individual image 
with respect to stellar objects of the same intensity. Advan- 
tages of this second stage include: the generation of a 'lin- 
earised' instrumental magnitude scale (see later); the ability 
to incorporate prior knowledge of the expected fraction of 
different classes of images; and a stellarness index in addition 
to the more usual discrete classification flag (stellar/non- 
stellar/noise). The stellarness index quantifies how well the 
parameters of a given image match the stellar template for 
that intensity. For ease of use, the index is given in terms 
of the equivalent normalised Gaussian distribution, of zero- 
mean and unit-variance - ie. an N(0,1) statistic. 
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(a) (b) (o) 




Magnitude Magnitude Magnitude 

Figure 1. Exclusion radii as a function of magnitude for (a) SERC-J/EJ; (b) SERC-ER/AAO-R; (c) SERC-I; (d) AAO-R on 4415 
film; (c) ESO-R and (f) POSS-I E surveys. The dashed lines are linear least-squares fits to the data while the solid lines are parallel to 
the fits but offset upwards to provide a conservative exclusion radius. Objects within the exclusion radius of a bright star have bit 10 set 
in their quality flag to indicate possible spurious nature. 



The first stage of image classification consists of gener- 
ating scatter plots using a combination of measures of the 
image magnitudes, surface brightness and peak brightness 
versus magnitude and automatically locating the stellar lo- 
cus in each. The stellar locii are automatically found by con- 
verting each scatter plot into a 128 x 128 two-dimensional 
map, where each "pixel" repesents the number of points ly- 
ing in that bin. Obvious noise-like objects, for example with 
eUipticity >0.9 or with areal profile slopes incompatible with 
real images, are not included in the two-dimensional count 
(but are shown in the the scatter plots). The location of 
the mode, and scatter about the modal value, are then de- 
termined, where possible, for each magnitude/peak slice and 
used to track the stellar locus and spread in stellar locus. For 
faint magntiudes where the stellar locus blends in with the 
remaining object locii, the centre-of-gravity of the generic 
distribution is used instead. 

Figures ^(a) to (c) show typical examples of these scat- 
ter plots, where the solid lines mark the boundaries of 
the stellar/non-stellar locus - defined to be 2a away from 
the mode, where the mode is well measured, or defined as 
the centre of gravity of the distribution for the faint end. 
The isophotal area versus magnitude diagram is addition- 
ally used to define a stellar/noise image boundary which 
is further used to fiag images not to consider in the other 
two plots. After some experimentation, a combination of the 
three scatter plots shown in Figures ^ was found to give a re- 
liable stage-1 classification over a wide range of magnitudes. 
Galactic location and plate/emulsion combinations. 

The final stage 1 classification is based on a combination 
of the signed sigma-normalised distances from the stellar 



locii from these three plots plus the image eUipticity infor- 
mation. As noted previously, at this stage the stellar/non- 
stellar boundary is set to conservatively class objects as stel- 
lar by defining a 2a boundary from the median stellar locus. 
In a final step, obvious non-stellar objects in the stellar bin 
are rebadged by checking for consistency of other parameters 
such as eUipticity and areal profile slope. 

At the completion of the first stage of image classifica- 
tion, each image has lAM parameter 19 overwritten by a 
classification code in the range 1 to 4 where l=non-stellar, 
2=stellar, 3=unclassifiable and 4=noise (i.e. images lying 
above the upper locus in Figure ^.) 

The provisional classification from stage 1 is then used 
as the primary input for the stage 2 classification scheme 
outlined earlier. This second phase builds upon the classi- 
fication already developed by using it as the basis for de- 
riving a more general descriptor of image shape. Providing 
the stage 1 stellar sample is a fair representation of the to- 
tal stellar population, and only has modest contamination 
by non-stellar images, it is feasible to use it as a means to 
automatically track the stellar locus for any combination of 
the measured SuperCOSMOS image parameters. 

Automatically deriving both the position and spread 
in the stellar locus as a function of magnitude, for any of 
the image shape parameters is straightforward, even with 
modest amounts of non-stellar object contamination, pro- 
viding robust estimation techniques are used (e.g. Hoaglin, 
Mosteller & Tukey 1983). For any desired image shape de- 
scriptor, the location of the median of the population suf- 
fices to define the locus as a function of magnitude, while the 
median of the absolute deviation from the median (MAD) 
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provides a good measure of the scatter about this locus. 
The MAD estimate of scatter can then be converted to 
the equivalent Gaussian sigma by noting, as before, that 
CGauss = 1-48 X MAD. By this means each of the individ- 
ual image shape descriptors can be renormalised to follow 
a zero-median, unit-variance Gaussian-like N(0,1) distribu- 
tion. 

Another part of the second stage of image classifica- 
tion consists of an iterative determination of the stellar ra- 
dial profile (ie. the stellar point spread function, PSF, for 
the plate) and the determination of a linear instrumental 
magnitude scale. This follows the procedure described by 
Bunclark & Irwin (1983) and makes use of the fact that, 
to a good approximation, all stellar images should have the 
same intrinsic profile shape whatever their magnitude. The 
areal profile measures (Equation ^ form the basis for this 
estimate, using the preset instrumental profile threshold lev- 
els as a first guess for the intensity scale. Figure ^ shows 
the results of a typical profile determination and linearised 
transfer function between isophotal and profile magnitude 
for a survey plate. The resulting stellar PSF can then also 
be used as part of the classification scheme. 

In order to make an optimal judgement about the class 
of an object, a traditional Bayesian classification scheme 
uses knowledge of the conditional and prior probabilities on 
the right hand side of the following equation 



p{s\d) = 



p{d\s)p{s) 



p{d\s)p{s) + p{d\g)p{g) + p{d\n)p{n) 



(5) 



where the probability, or likelihood, of observing the image 
descriptors for stars, galaxies and noise are given by p{d\s), 
p{d\g) andp(d|n) respectively; p{s), p{g) andp(n) are the re- 
spective prior probabilities of finding a star, galajcy or noise 
image; and p{s\d) is the posterior probability of the image 
being a star. In a similar way we can write equations for 
p{g\d) and p{n\d), the posterior probability of the image be- 
ing a galaxy or noise object. 

Unfortunately, the distribution of both galaxy and noise 
image parameters is in general unknown, implying that we 
cannot directly use a Bayesian estimator in this case. The 
problem is further compounded by the fact that the real 
likelihood function for stellar images is also non-trivial to 
estimate. This is caused by the underlying error distribu- 
tion of the parameters being non-Gaussian in the real world 
and also because the image parameters are highly corre- 
lated, with the degree of correlation varying significantly as 
a function of magnitude. 

However, deriving the stellar PSF as part of the internal 
intensity calibration helps to significantly reduce the num- 
ber of free parameters required to accurately describe the 
shape of an object. Each image has (up to) 8 parameters 
measuring the areal profile which can be readily converted 
into the equivalent radial profile. Clearly, images with de- 
rived radial profiles significantly steeper than the stellar PSF 
are more likely to be noise images, while those with derived 
profiles much shallower than the stellar PSF are more likey 
to be galaxies. After some experimentation we found that 
the intensity-normalised slope of the sigma-weighted differ- 
ence between an image radial profile and the stellar PSF 
provided a way of summarising the stellarness of an image 
using all the areal profile information. This measure pre- 
serves both the 'sign' of the deviation and quantifies the 



difference. The normalised-slope measure is then combined 
additively with (the relatively) independent information de- 
rived from the intensity-weighted second moments and peak 
intensity to produce a single number measure. Finally, as 
a function of magnitude, all objects previously classified as 
stellar, are used to convert the stellarness index back to a 
zero-median unit-variance N(0,1) score. This stellarness in- 
dex is our pragmatic equivalent of the likelihood function of 
the data and summaries the probability that the observed 
data arose from a stellar object. 

A typical example of the derived stellarness index for 
a survey plate is shown in Figure ^ with the primary stel- 
lar locus boundaries adopted {+2a for star/galaxy and —3a 
star/noise). As a further refinement this average value of the 
stellarness index as a function of plate position is tabulated 
and used to remove small variations in the zero-point of the 
stellar locus caused by field effects. 

Returning to the idealised Bayesian scheme outlined 
previously, for star/galaxy separation the main quantity of 
interest is 



Pis\d) ^ p{s)p{d\s) 
P{g\d) p{g)p{d\g) 



(6) 



since when this ratio is greater than unity the object is classi- 
fied as a star, and when less than unity as a galaxy. Although 
the underlying distribution, p{d\g), is unknown, we do know 
that both galaxy and noise images have a much wider spread 
of image shape descriptors than stars, hence to a reasonable 
approximation p{d\g) = U{a,b) and p{d\n) = U{c,d); where 
U denotes uniform distribution and the arguments the range 
over which it applies - which in general are a function of 
magnitude. In the absence of prior information the ratio of 
p{s\d) / p{g\d) , if p{g\d) = U{a,b) and one end of the range 
(a, b) lies within the normal stellar distribution, is exactly 
equivalent to specifying a one-sided k-sigma boundary sep- 
arating stars from galaxies. 

The prior information only enters via the ratio of the 
expected numbers of star:to:galaxies at a particular magni- 
tude. For most survey fields (i.e. not the Galactic Plane) 
the total number of stellar images and the total number of 
galaxy images visible on a survey plate are roughly equal. 
As a function of magnitude, however, the stellar population 
varies roughly as n(m) oc lo'^-^^™ while the number of galaxy 
images varies as n(m) oc lo" ''^'" . Using the equality of total 
numbers of star and galaxies near the plate limit to define 
the relative normalising constant, automatically specifies the 
ratio p{s)/p{g) = constant x iQ-o ^'" = iQ-0-2(m-m.,,t) j^j, 
all magnitudes. This prior information ratio can then read- 
ily be incorporated into the previous boundary definition by 
defining a new k'-sigma boundary using the assumed N(0,1) 
distribution of the likelihood function to convert ratios to 
sigmas. 

Although inevitably somewhat adhoc, this scheme does 
allow us to fold in additional information within the frame- 
work of a Bayesian approach (e.g. prior knowledge of 
star:to:galaxy ratios and the knowledge that galaxies and 
noise images have a wide range of image shapes). The ad- 
vantages of decision making directly in k-sigma space are 
that it is straightforward to place realistic constraints on 
the influence of the prior knowledge, i.e. by limiting how 
much the boundary can shift, and it also mitigates against 



© 0000 RAS, MNRAS 000, 000-000 



8 N. C. Hambly et al. 

(a) (b) 




20 40 60 15 30 

Radius (pixels) Profile magnitude 



Figure 3. Results of the iterative determination of the stellar profile for a typical plate: (a) differential profile estimate; (b) initial profile; 
(c) iterated profile; (d) transfer function of isophotal to profile magnitude (see Bunclark & Irwin 1983). 




-C03MAG/1000 - 7.5 

Figure 4. Scatter plot of the profile classification statistic for a typical survey plate. This indicates the residuals (in terms of a zero 
mean and unit standard error quantity) of each image's areal profile set when compared with an average stellar template. The boundaries 
indicate the position of all objects finally classed as stellar. Broadly speaking, objects with steep radial profiles (hard, noisy images) have 
low values, while objects with shallow radial profiles ('fuzzy' images, more often than not galaxies) have high values. An object matching 
the stellar template perfectly has a value of zero. 
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Survey 


No. of 


Gradient of profile scale 




fields 


min 


max 


mean icr 


SERC-J/EJ 


25 


1.133 


1.456 


1.281 ±0.080 


AAO-R/SERC-ER 


19 


1.118 


1.239 


1.178 ±0.028 


SERC-I 


6 


0.947 


1.156 


1.086 ±0.074 


ESO-R 


14 


1.038 


1.179 


1.107 ±0.044 


POSS-I E 


11 


0.956 


1.212 


1.070 ±0.081 


AAO~R on 4415 film 


4 


1.182 


1.297 


1.243 ±0.044 



Table 3. Gradients of the profile magnitude scale for the various 
types of survey plate comprising the SGC dataset. 



the effect of non- Gaussian errors by automatically scaling 
the boundary using a robust sigma estimator. 

2.3 Photometric calibration 

The isophotal instrumental magnitude scale for lAM data 
is non-linear with respect to external apparent magnitude. 
Non-linearities are introduced by, for example, thresholding 
and the limited dynamic range of the measuring machine in 
addition to any non-linearities inherent in the photographic 
characteristic curve with respect to the linear D — log/ 
scale assumed. Additionally, the magnitude scale for galax- 
ies will be, in general, different to that of stars. The rea- 
son for this is that galaxies are generally more extended 
and of lower surface brightness than stars, and as such have 
integrated magnitudes that are less susceptible to any dy- 
namic range limit during measurement. Moreover, the lin- 
earised profile magnitude for stars is easier to calibrate than 
the isophotal scale when external calibration data are lim- 
ited since it is better behaved. Note that the profile mag- 
nitude for a non-stellar object is not useful since such ob- 
jects have, of course, non-stellar profiles. For these reasons, 
the stellar/non-stellar magnitude calibrations for the sur- 
vey data are treated separately. The calibration procedure 
adopted was as follows: 

2. 3. 1 Gradient of the instrumental stellar profile 
magnitude scale 

For each of the survey colours (e.g. SERC-J, ESO-R) the 
gradient of the faintest 7 mag of the profile magnitude scale 
was measured against CCD data from Boyle, Shanks & 
Croom (1995) and Croom et al. (1999). Throughout this pa- 
per, where any comparisons have been done between photo- 
graphic and photoelectric photometry, colour equations from 
Blair & Gilmore (1982), BesseU (1986) and Evans (1989) 
have been used to naturalise the magnitudes to the photo- 
graphic Bj/R63p/R59f/E2224/Iivn Systems. The results of 
the measurements are shown in Table ^ where the num- 
ber of plate gradients and a standard error estimate are 
also given. We found that the RMS residuals of the linear 
fits between the profile and standard magnitudes were in 
the range 0.1™ to 0.2™, in line with expectations for photo- 
graphic photometry of fainter stars and also demonstrating 
that the profile magnitude scale has no increased scatter 
over that present in the raw isophotal scale (e.g. Bunclark 
& Irwin 1983). The mean gradients in Tableware reasonably 
close to unity with small standard errors. This implies that 



the assumed calibration used for transmission to intensity 
was a reasonably good one. Furthermore, the small scatter 
in the gradients means that rough calibration to the plate 
limit is possible on a field-by-field basis using a small num- 
ber of relatively bright standards in each field to define a 
bright stellar calibration curve that can be extrapolated to 
the plate limits. Figure ^ shows a typical example of a deep 
photometric sequence plotted against the linearised profile 
magnitude scale, demonstrating how closely linear the pro- 
file scale is over many magnitudes above the plate limit. 
Contrast this with the same plotted for the isophotal scale, 
which is non-linear at the 0.5 mag level. 

2.3.2 Rough profile magnitude calibration for stars 

In order to roughly calibrate the stellar profile magnitude 
scale of each field with respect to external standards, a com- 
pilation of BV photometry over the whole sky was made by 
combining the Tycho Catalogue (ESA 1997) and the Guide 
Star Photometric Catalogue I (Lasker et al. 1988) . This pro- 
vided two colour photometry down to V ~ 15 in every sur- 
vey field. Standard magnitudes in the RI bands have been 
estimated from relationships between (B-V) versus (V-R) 
and (V-I) derived from the multicolour standard star data 
of Landolt (1992). For each field, a calibration curve of the 
profile instrumental magnitude versus naturalised standard 
magnitude was derived. Figure ^ shows a typical example 
for the survey data. The solid lines are least-squares fits to 
the data, with the added linear constraints that at a fixed 
point 3 mag fainter than the the faintest calibrating stan- 
dard, thegradient is set at the appropriate mean value given 
in Table N while the second derivative of the fitting function 
is set to zero. In this way, the calibrating function can be 
linearly extrapolated from this point to the plate limit, and 
the calibrating function has no discontinuities in itself or 
its first derivative. This somewhat complicated procedure 
is an attempt to realistically model the shape of the 'lin- 
earised' stellar profile magnitude scale, which is indeed lin- 
ear over its faintest 7 mag range (e.g. Figure ^), but which 
departs from linearity brighter than this due to the funda- 
mental dynamic range limits of the machine as dictated by 
diffraction-limited performance in the imaging optics and 
the transmission digitisation range, along with the dynamic 
range limit of the photographs themselves. 

The fact that the calibrations are extrapolated for the 
fainter magnitude range coupled with the paucity of external 
calibrating stars on the linear part of the profile scale results 
in small (10% to 20%) errors in the zeropoint of the faint 
magnitude calibration of each field. This problem is dealt 
with in the next Section. 



2.3.3 Global zeropoint definition 

All the Schmidt surveys have varying degrees of overlap 
between adjacent fields. For the POSS-I E and ESO-R 
surveys, this overlap can be very small (or non-existent 
in the case of a few adjacent ESO-R fields), but for the 
UK Schmidt surveys the field centres were chosen to give 
~ 0.5° overlap strips for all fields (this system of field 
centres, refiected into the northern hemisphere, has been 
adopted for the POSS-II survey). These overlap regions can 
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Figure 5. Plots of naturalised photoelectric magnitudes verus (a) profile magnitude and (c) isophotal magnitude for R band data in 
survey field 413. Panels (b) and (d) show the non-linear components after linear least-squares regression fits. The profile scale is closely 
linear over ~ 8 mags. The isophotal scale is non-linear at the level of > 0.5 mag; furthermore, such non-linearities are not repeatable 
from field to field. 



(a) (b) 
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Figure 6. Calibration curve of the linearised stellar profile magnitude scale versus external photoelectric photometry naturalised to the 
photographic system. The solid line in (a) is a least-squares polynomial fit constrained to have the required gradient at the faint end. In 
(b) we show the residuals of the data about the fit. 
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be used to define a set of individual field photometric zero- 
points such that the mean difference over all stars between 
the stellar magnitudes from the two measurements available 
per star in any given overlap region is zero. Consider any 
two adjacent fields m and n. The mean (systematic) differ- 
ence between the stellar magnitudes in the overlap region 
is A„rn = Zn — Zm whcrc Zn and Zm are the unknown ze- 
ropoints. A mosaic of many fields gives a (singular) set of 
equations in the A and z values. Such a set may be solved 
by singular value decomposition to yield minimal zeropoints 
that solve the set. However, we have used faint CCD pho- 
tometry to externally fix as many zeropoints as possible 
over the survey region. This makes the equation set non- 
singular and yields a unique vector of zeropoints that are 
tied to an external system. This procedure is similar to that 
adopted by Maddox, Efstathiou & Sutherland (1990) for the 
APM Galaxy Survey except that we have not solved for 
the field-to-field differences in the gradients of the photo- 
metric scales. Also, our sources of external CCD photome- 
try included data from the Guide Star Photometric Cata- 
logue II (Postman et al. 1997), Boyle et al. (1995), Groom 
et al. (1999), Cunow et al. (1997) and references therein, 
unpublished data used in the calibration of the Edinburgh- 
Durham Southern Galaxy Catalogue (e.g. Nichol 1993 and 
references therein) and Stobie, Sagar & Gilmore (1985) in 
addition to that published in Maddox et al. (1990). At the 
time of writing, approximately 50% of all southern hemi- 
sphere fields have external CCD data defining their photo- 
metric zeropoint. 

2.3.4 Correction of systematic errors in stellar colours as 
a function of magnitude and plate position 

The photometric calibration of photographic plates based 
on calibrating stars necessarily assumes that there exists a 
unique calibration curve that is applicable over the entire 
plate. In general, this is not the case. For example, graphs of 
external photoelectric magnitude versus internal linearised 
profile magnitude, as a function of plate position, show sig- 
nificant changes in gradient. One possible cause of this is 
that the response of the photographic emulsion is not uni- 
form over the plate - i.e. the characteristic curve of the 
emulsion is not constant. Whatever the cause, the results 
are the same: systematic errors in photometry as a function 
of magnitude and position occur, and those errors manifest 
themselves most visibly when magnitudes are combined to 
generate colour indices. Moreover, we have assumed a con- 
stant value of the gradient of the profile magnitude scale in 
any given passband, and as Table ^ shows there is a signifi- 
cant dispersion about the mean values adopted which adds 
to any systematic effects within a given plate. For example. 
Figure ^ shows an I versus (R-I) colour-magnitude diagram 
for survey field 758. There is apparently a trend for brighter 
stars to get become bluer in this diagram, along with a signif- 
icant increase in scatter about the modal colour for brighter 
magnitude ranges. These effects are both due to system- 
atic errors in photometry as a function of magnitude and 
position. Furthermore, Figure ^ shows a stellar (J-R) ver- 
sus (R-I) two-colour diagram for a mosaic of 9 fields at the 
South Galactic Pole for the magnitude range 17 < Bj < 18. 
While the general trend of red objects being red in both 
colours is apparent, it is also true that stars having interme- 




(R-I) 

Figure 7. Stellar colour-magnitude diagram (I versus R-I) for 
survey field 758 before correction of systematic colour errors as a 
function of magnitude and position. There is no strong redden- 
ing in this high latitude field, so the large colour change with 
magnitude has to be due to systematic calibration errors. 



diate colours do not show a good correlation between (J-R) 
and (R-I). The scatter in Figure |^ is rather large, and masks 
astrophysically useful information. 

These problems have been documented in the past 
(e.g. Hambly et al. 1995; Goldschmidt & Miller 1998 and 
references therein). In the abscence of large-scale external 
calibration sequences, there is no option other than to use 
the stellar distributions themselves (in colour, as a function 
of magnitude and position) to measure the size of these ef- 
fects and to calibrate them out. Such corrections require 
that one passband is chosen as the standard with respect 
to which all other passbands are corrected. Here, we have 
chosen the J plates for this purpose because the most widely 
available external photometry is in the B band and we ex- 
pect that the rough calibrations (Section 2.3.2) will be the 



most accurate in this passband. The correction procedure is 
straightforward, and is applied to each set of R and I field 
data individually. First, shifts in colour as a function of posi- 
tion are measured by cross-correlating smoothed histograms 
of number-colour counts as a function of magnitude and 
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I 




(Bj-R) 

Figure 8. Stellar two-colour diagram (Bj-R, versus R-I in the 
magnitude range 17 < Bj < 18) for 9 contigous survey fields 
around the South Galactic Pole before correction of systematic 
colour errors as a function of magnitude and position. 



Survey 



AAO-R/SERC-ER 
SERC-I 
ESO-R 



Colour Position of 
blue edge 



(Bj-R) 
(Bj-I) 
(Bj-R) 



0.789 
1.204 
0.774 



Table 4. Values of the lOth-percentile blue 'edge' used in fixing 
the survey colours (see text). 



plate position in coarse bins of 1 mag and 2 cm respectively. 
The resulting data cubes of shifts are then padded, resam- 
pled onto a 3x finer grid and linearly filtered and smoothed 
to yield final shift values to be applied to the data. The 
final stage of the correction procedure consists of forming 
number-magnitude histograms as a function of magnitude 
only, in 0.1 mag bins, and finding the 10th percentile blue 
'edge' of the distributions. Hence, a shift of this value with 
respect to a global value (measured as the median 10th per- 
centil blue edge of faint stars over all the fields in the sur- 
vey) is measured and can be applied to the data to shift and 
straighten the blue edge to an internally consistent position. 
Table ^ shows the values measured over the whole survey for 
anchoring the colours in this way. Figures ^ and |l^ show the 
results of this procedure for direct comparison with Figures ^ 
and 1^. There is a significant improvement in the morphology 
and reduction in scatter in these diagrams after correction. 

Of course, it should be noted that this procedure can 
potentially remove real astronomical features from colour- 
magnitude and two-colour diagrams. For example, differen- 
tial reddening across a field would tend to be calibrated out 
by the above procedure. For this reason, the database access 
software (Paper I) allows the user the option of turning off 




(R-I) 

Figure 9. Stellar colour-magnitude diagram (I versus R-I) for 
survey field 758 after correction of systematic colour errors as a 
function of magnitude and position. 



these corrections at access time so that uncorrected colours 
are available if required. 

2.3.5 Galaxy (isophotal) magnitude calibration 

As mentioned previously, the galaxy magnitude calibration 
must be treated separately to that of stellar objects. The 
isophotal magnitude scale is tied to that of stellar objects 
using the faintest 2.5™ range of objects where the isophotal 
scale for stars and galaxies is the same since at these magni- 
tudes, the majority of galaxies are unresolved on the plates 
and have the same profile shapes. In order to apply a simple 
linear fit between isophotal and profile magnitudes, however, 
the isophotal scale must first be corrected for thresholding 
non-linearity (the profile scale is, of course, linear over much 
more than the faintest 2.5™ range). 

The thresholding correction to the isophotal scale was 
modelled assuming a Gaussian profile. 



I{r) = /pEAKexp(-r^/2a^ 



(7) 



from which integration yields /tot ~ 27ra^JpEAK where 
o-=FWHM of the profile. If thresholding truncates the in- 
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(Bj-R) 

Figure 10. Stellar two— colour diagram (Bj-R versus R— I in the 
magnitude range 17 < Bj < 18) for 9 contigous survey fields 
around the South Galactic Pole after correction of systematic 
colour errors as a function of magnitude and position. 

tegrated brightness at a radius rt, then it is easy to show 
that 

100/pEAK 



n = 2o-^ log, 



(8) 



I /sky X PCUT J ' 

whe re /sk y and PCUT are defined by Equation ^ in Sec- 
We also have that the difference between the total 



2.1.4 



tion 

and isophotal integrated intensities of an object is given by 

p oc 

/tot - /isophotal = 27r/pEAK / exp(-r^/2o-^)rdr 

J rt 

27ra^/sKY X PCUT 
100 

and defining an isophotal correction e such that 
mcORR = miSOPHOTAL — 2.51ogj^(,(l + e), 
it is straightforward to show that 



e= 1 



/sky X PCUT -(-^ ^ 



(9) 
(10) 

(11) 



Hence, a calibration of mcoRR versus the calibrated pro- 
file magnitude for the same image is possible, and is fur- 
thermore applicable to the full threshold corrected isopho- 
tal scale. Note that e is not a correction to 'total' galaxy 
magnitudes (e.g. Young et al. 1998 and references therein) 
since such a correction is dependent on galaxy profile shape. 
Note also that we calibrate mcoRR + 2.5 logj^Q /sky to pro- 
vide a first-order correction for vignetting and differential 
desensitisation as a function of plate position (e.g. Maddox 
et al. 1990). An example calibration of mcoRR versus cali- 
brated profile magnitude is shown in Figure ^ In this way 
a calibration for non-stellar objects is defined in every field, 
this calibration being tied to the same external zeropoints 
as the stellar magnitude scale. 

It is worth emphasising that the calibrations set up for 



the survey photometry take the form of coefficients values 
that are applied to raw measurements as and when the data 
are accessed (see Paper I). This allows recalibration at any 
point in the future when better methods and/or better exter- 
nal calibration data become available. The raw, uncalibrated 
and uncorrected isophotal and profile instrumental magni- 
tudes are also available, along with all morphological infor- 
mation produced during pixel analysis. This will allow, for 
example, investigation and implementation of more sophisti- 
cated galaxy calibration techniques in the future (e.g. Young 
et al. 1998 and references therein). 



3 RESULTS AND DISCUSSION 

When assessing the accuracy of the measurement, classifi- 
cation and calibration schemes detailed in the previous Sec- 
tion, it is important to distinguish between the various types 
of error that it is possible to measure and quote. This is par- 
ticularly important when comparing the SSS characteristics 
against those of other survey programmes (e.g. Paper I). 
Here, we have assessed the internal consistency of the sur- 
vey data by comparison between data for the same objects 
from overlap regions (i.e. where two independent measure- 
ments of the same quantity are available). Under the as- 
sumption that the two overlapping plates have the same 
characteristics of image quality and signal-to-noise ratio, 
this also provides estimates of the random errors (dividing 
by .^2, where appropriate, for a single measurement). Note 
that similar comparisons in overlap regions other than those 
specifically used here yield similar results provided they have 
Galactic latitudes |6| > 30°. As it will be shown in the next 
Sections, the random errors in single colour photometry are 
dominated by position and magnitude dependent systematic 
errors for brighter magnitude ranges, and it is therefore use- 
ful to quote an apparent 'standard' error in the presence of 
such effects, and also a relative error when comparing pho- 
tometry within restricted magnitude and position ranges. 
The latter thus refiects the true, underlying random errors 
in the measurement process and is useful when the data 
are being used in isolation (for example, inter-comparison 
of object magnitudes) while the former is more appropriate 
when measurements are taken at face-value and compared 
with external data. Note however that the calibration pro- 
cedure removes systematic errors in colour as a function of 
magnitu de an d position and across the survey as a whole 
(Section 2.3.4 ) so that the SSS object colours (e.g. Bj-R, 
R-I) are much more accurate than the external errors on 
individual passband magnitudes. This is illustrated later. 

Of course, systematic errors can only be truly assessed 
with respect to external data that can be regarded as 'gold' 
standard, i.e. free from systematic error and having negligi- 
ble random error, and we also describe comparisons against 
external CCD data that are of sufficient angular resolution 
and signal-to-noise that they can be regarded as error-free 
for these purposes. 



3.1 Internal consistency and relative accuracy 
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Figure 11. Calibration curve of the mcoRR (see text) for non-stellar objects versus calibrated profile magnitude for faint, unresolved 
images. 




Figure 12. Classification scores when comparing data from two overlapping plates for (a) SERC-J/EJ, (b) AAO-R/SERC-R, (c) SERC— 
I and (d) ESO-R surveys. Solid circles are for stars while open circles are for galaxies; solid and dashed lines indicate the percentage 
accuracy of a single classification for stars and galaxies respectively (see text). 



© 0000 RAS, MNRAS 000, 000-000 



SuperCOSMOS Sky Survey II: Image detection & parameterisation etc. 15 



3.1.1 Image classification 

In order to assess the consistency of the image classification, 
we compared data from each of the four survey types J,R,I 
and ESO-R in the large (~ 4°) overlap region between field 
numbers 292 and 241. In Figure p^(a) to (d) we show plots 
of the percentage agreement between the two independent 
classifications, as a funcion of magnitude, for stars (solid 
circles) and galaxies (open circles). These were analysed for 
the underlying correct classification for a single measurement 
using the following argument. Assume that there are just the 
two image classes, stellar and galaxian (the fraction of noise 
and unclassifiable images on any plate is always ^ 1%). 
If we denote the probability that a real star is classified 
as such in any given magnitude range by pi(s|s) and the 
probability that a real galaxy is classed as such by Pi{g\g) 
along with probabilities for incorrect classification hy pi{s\g) 
and pi{g\s) (where the subscript refers to plate 1 and similar 
definitions are made for the overlapping plate 2), we have 
that for plate 1 for example 



pi{s\s) + pi{g\s) = 1 

piiglg) + pi{s\g) = 1 



(12) 
(13) 



and similarly for plate 2. Between the two independent mea- 
surements, the probability of agreement for stellar class (for 
example) is equal to the sum of the probability that both 
measurements class a real star as such and the probability 
that both measurements misclassify a galaxy as a star; 

p(agree s) = pi{s\s)p2{s\.s) + pi{s\g)p2{.s\g); (14) 

likewise for agreement on galaxian classification: 



p(agree g) = Pi{g\g)p2{g\g) + Pi{g\s)p2{g\s) 



(15) 



Assume now that the noise parameters and general im- 
age quality of the plates are the same, i.e. write p{s\s) = 
P\[s\s) = p2{s\s) etc. and since we are not interested in the 
cross terms in Equations Fi] and |l^ they can be eliminated 
using Equations [13 and |l3pto yield 



p(agree s) = p{s\sf + {1 - p{g\g)}^ , 
p{agTee g) = p{g\gf + {1 - p{s\s)}^ , 



(16) 
(17) 



which is a set of two non-linear equations in the two un- 
knowns p{s\s) and p{g\g). These are soluble using a simple 
iterative scheme, and in Figure ^(a) to (d) we plot the de- 
rived functions p{s\s) (solid lines) and p{g\g) (dashed lines). 

From Figure [13, the image classification scheme is 
clearly working well, with stellar classification being inter- 
nally repeatable/reliable at the > 95% level for magnitudes 
Bj < 21, R < 19 (or R < 18.5 for ESO-R) and I < 17.5. 
For galaxies, classification is reliable at the > 90% level in 
the ranges 17.5 < Bj < 21.5, 15 < R < 19 but is clearly not 
so good for the I plates. This is possibly due to the gener- 
ally poorer signal-to-noise of I band Schmidt plates. Note 
that reliable image classification for all I images is of course 
available from the Bj data unless an object is severely red- 
dened or intrinsically very red. It is important to emphasise 
that the only true test of the image classification reliability 
is against external data. This is because the above analy- 
sis cannot take into account any systematic effects (for ex- 
ample, classification of large numbers of faint, unresolved 
galaxies as stellar objects). Such a rigorous test is detailed 



in Section |3.2.1 . On the other hand, this internal test illus- 
trates that the classification scheme is not severely limited 
by random errors at magnitudes as faint as ~ 1.5™ above 
the respective plate limits. 



3.1.2 Photometry (single passband) 

Using the same overlap region as in the previous Section, 
we have compared the photometry to assess the internal ac- 
curacy of the photometric measurements and calibrations. 
Tables ^ to ^ give detailed accuracy figures as a function 
of magnitude for the J,R,I and ESO-R surveys respectively. 
In each Table, the figures for stars and galaxies are given 
separately since galaxy photometry will be less accurate at 
a given magnitude (e.g. Irwin 1985). The columns in each 
Table for each image type are the number of objects used in 
obtaining the measurements; the raw RMS (including any 
systematic effects); the median global zeropoint error; the 
RMS when this global zeropoint is removed; and finally the 
RMS when a position-dependent zeropoint is removed. Ze- 
ropoints as a function of position were determined from the 
data by taking median values over a scale size of ~ 2 cm 
(~ 20 arcmin) on the plates. 

The following points should be noted from Tables ^ to ^ 

• There are no systematic zeropoints at the faint end - 
this is of course a result of the plate matching procedure 



detailed in Section 2.3.3 



• There are increasingly large systematic errors for in- 
creasingly bright stellar magnitude ranges. 

• Galaxy magnitudes are less susceptible to systematic 
effects than are stellar magnitudes. 

• The relattve precision of stellar magnitudes (i.e. within 
restricted position and magnitude ranges) is 5% or better 
when 3 mag or more above the respective plate limits. 

• Relative galaxy magnitudes are, in general, worse than 
those of stars in a given magnitude range, as expected. 

The systematic errors for bright stellar images are partly due 
to positional effects and partly due to the assumption of a 
mean gradient for the linear part of the profile magnitude 
scale. Since the gradient of the isophotal scale is also tied to 
this mean gradient, then systematic effects are also increas- 
ingly apparent in increasingly bright galaxian magnitude 
ranges due to the linear extrapolation fr om th e faint lev- 
els at which the zeropoint is fixed. Section 3.2.2| shows some 
of these effects when comparing SGC photometry against 
an independent, external dataset. 

Figure [l3|(a) to (j) illustrates some of these effects 
graphically, where we have plotted stellar J survey data in 
five magnitude ranges of 2 mag size in the range 13 < J < 23. 
The left-hand panels show histograms of raw magnitude dif- 
ferences, while the right-hand panels show the underlying 
distributions of random errors left over after positionally- 
dependent systematic errors have been removed. Once again, 
the increase in random error, and decrease in systematic er- 
ror, with decreasing brightness are apparent. 



3.1.3 Photometry (colours) 

Figure |l4| shows the consistency of our survey colours J- 
R (with R from both UK and ESO Schmidts) and R-I in 
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Figure 13. Histograms of differences between J survey photometry measured in the overlap region of fields 292 and 241 for the magnitude 
ranges (a,b) 13 < Bj < 15; (c,d) 15 < Bj < 17; (e,f) 17 < Bj < 19; (g,h) 19 < Bj < 21; and (i,j) 21 < Bj < 23. The left-hand panels 
are raw differences; the right-hand panels show distributions after subtraction of positionally dependent systematic errors. 
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0.14 
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0.06 




17.0 < Bj 


< 18.0 


4305 


0.42 


-0.40 
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0.05 
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0.15 
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Table 5. Internal systematic and random errors measured from the overlapping plate pair SERC-J fields 292 and 241. 
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0.04 


197 


0.06 


-0.05 
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0.05 
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0.09 


0.09 


0.06 
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0.07 
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0.12 


0.11 
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0.14 


0.02 


0.14 
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0.18 


0.07 
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0.11 
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Table 6. Internal systematic and random errors measured from the overlapping plate pair AAO-R fields 292 and 241. 



the magnitude range 16 < Bj < 17. Once again, because we 
have calibrated out the systematic errors in colour as a func- 
tion of position and magnitude, a,nd a lso set the colour ze- 
ropoint in every field (see Section ^.3.4| ) the colours are con- 
sistent at a level dictated by random errors alone. The RMS 
values about the y = x lines in Figure |l^ are ~ 0.1 mag, 
indicating single measurement random errors in the colours 
at the level of ~ 0.07 mag in this magnitude range. While 
this is strictly speaking a test of internal consistency, it also 
illus trates the true external errors in the colours (cf. Sec- 
tion 3.2.3) owing to the way in which any systematic errors 
have been eliminated. 



3.2 External tests of accuracy and reliability 

3.2.1 Image classification and completeness 

We have assessed the accuracy of image classification and 
detection completeness of SGC data using 4m prime focus 
data from Carter (1980) and CCD data from Metcalfe et 
al. (1991 - hereafter MSFJ91). Those data are demonstra- 
bly deeper than that from the Schmidt plates and although 
they are only slightly better in terms of angular resolu- 
tion (e.g. 1.5 arcsec for the MSFJ91 CCD images as op- 
posed to ~ 2 arcsec for the SGC data) their higher signal- 



to-noise ratio at a given magnitude allows thresholding at 
lower isophotes and therefore more reliable image classifi- 
cation. In Table ^ we give figures for comparison against 
the galaxy sample of Carter (1980). This comparison shows 
completeness levels of 100% down to Bj ~ 21 with > 90% 
reliability of correct classification down to Bj ~ 20. This 
is in good agreement with other estimates of the reliability 
of Schmidt survey galaxy catalogues (e.g. Caretta, Maia & 
Willmer 2000) . Tables and ^ give similar figures for com- 
parison of sets of stars and galaxies from MSFJ91 against 
SGC J and R data. Field numbers 823 to 826 and 891, 893 
and 894 were used in these comparisons (fields 4, 13, 10, 14, 
2, 3 and 9 respectively in Table 1 of MSFJ91). 

The reliability figures in Tables ^ and |lo| clearly do 
not agree with the internally assessed classification accu- 
racy estimates displayed in Figures |l^(a) and (b). The rea- 
son for this is that fainter than m ~ 20, the typical size of 
detectable galaxian images is comparable to the seeing on 
the photographic plate material, and the image classifica- 
tion scheme inevitably classifies unresolved sources as stars. 
In Figures |l|(a) to (b) the accuracy of the classification is 
overestimated since there is a systematic trend for objects 
to be classed as stellar regardless of their actual type. The 
false level of agreement on stellar classification potentially 
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Table 7. Internal systematic and random 


errors 
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overlapping plate pair 


SERC-I fields 292 
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Table 8. Internal systematic and random errors measured from the overlapping plate pair ESO-R, fields 292 and 241. 



leads to a correspondingly false estimate of the underlying 
classification accuracy for both image types; however it is 
clear from the internal reliability that some very faint, non- 
stellar sources must be consistenly classified as such when it 
is possible to detect their non-stellar nature from the pho- 
tographic data. 

It is interesting to note that the reliability estimates 
from Table ^ were made from a field which presents a small 
zenith distance at the UK Schmidt Telescope, while Ta- 
bles |l| and are based on data at the celestial equator. 
This probably accounts for their poorer image classification 
reliability. 

3.2.2 Systematic photometric errors as a function of 
magnitude 

As illustrated in Figure while the survey field photo- 
metric zeropoints are defined globally using faint objects in 
overlap regions along with a network of CCD standards, at 
increasingly bright magnitudes systematic errors become ap- 
parent. This is due to the assumption of a single global gra- 
dient for the characteristic curve for the plates in any given 
colour. Furthermore, it is to be expected that systematic 
errors will be dependent on plate position as well as magni- 
tude, since the photographic response will not be perfectly 
uniform within a single plate. This is illustrated in more de- 
tail in Figure ^5|(a) to (d), where we have compared SGC R 
photometry against the SDSS calibration region A (Stone, 



Pier & Monet 1999). These data are in a slightly different 
passband (SLOAN r as oppose to R59F) but the difference 
is negligible at the accuracy we require, and the random er- 
rors on the CCD data are negligible compared to the photo- 
graphic errors. The CCD data encompass ~ 3.5° x 7.5° and 
are therefore ideal for showing position dependent systemat- 
ics over areas that traverse photographic plate boundaries. 

In Figure 0(a) to (d) we have divided the data in 10 < 
R < 18 into four 2 mag ranges; the vertical error vectors are 
scaled such that 1™ = 1°. Only stars have been used in this 
analysis. The following points should be noted: 

• In the faintest magnitude range (R > 16) there are no 
significant zeropoint errors within a field or in traversing to 
an adjacent field. 

• There are global systematic zeropoint errors within in- 
dividual fields at the level of several tenths of a magnitude 
for R < 16; these are, in general, different for different fields. 

• At bright magnitudes, the systematic errors change 
within a single field as a function of plate position. These 
variations are once more at the level of several tenths of a 
magnitude. 



3.2.3 Effective external errors in photometry 

Finally, we have checked the external errors (i.e. those re- 
sulting in a straightforward comparison between an external 
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Figure 14. Comparisons of survey colours in the overlap region of fields 241 and 292 in the magnitude range 16 < Bj < 17: (a) Bj — 
(ESO— R); (b) Bj —R (UKR) and (c) R— I. The dashed lines shows y = x. The RMS dispersion is ~ 0.1 mag indicating single-measurement 
errors of ~ 0.07 mag in any colour in this magnitude range. 
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Table 9. Comparison of the galaxy sample of Carter (1980) against the corresponding SGC J data. 



dataset and our survey data, with no regard as to the sys- 
tematic nature of those errors) of our BRI photometry in the 
equatorial zone by combining data from Landolt (1992) and 
Boyle et al. (1995). Note that although the Boyle et al. data 
have bee n used to measur e gradients and tie down zeropoints 
(Sections 2.3.1 and 2.3.3), they have not been used to derive 
detailed individual calibration curves on a plate-by-plate 
basis. Galaxy photometric precision has been assessed using 
the Metcalfe et al. (1991) data along with data from Cunow 
et al. (1997 and references therein). Table |l^ shows the re- 
sults. The main point to note from this comparison is that 
brighter than m ~ 15, there can be large errors in the pho- 
tometry (their origin is discussed previously). Fainter than 
this, in general the stellar photometry is typically accurate 
to a ~ 0.3 (at best, cr ~ 0.1) until the respective plate limits 
are approached, whereupon emulsion noise begins to dom- 
inate and the random errors rise again (this is particularly 
noticeable for the lower s/n SERC-I and POSS-E plates). 
It is also clear that galaxy photometry is, in general, less ac- 
curate than stellar photometry in a given magnitude range 
as expected. 

Figure |l^ illustrates external checks on the colour in- 
dices Bj-R and Bj-I from the same data, in the natural sys- 
tems of the photographic emulsion/filter passbands. These 
indicate the external accuracy of the SSS colours. Despite 
the presence of large systematic errors on individual mag- 
nitudes, the colours are accurate to the levels indicated 
in the previous internal consistency check: at Bj ~ 16.5, 
the colours are accurate to the level ctb-r.r-i ~ 0.07; at 



Bj ~ 20.0 (typical of the external comparison stars in Fig- 
ure |l^) the errors rise to ctb-r.r-i ~ 0.16. Moreover, there 
is no evidence of any systematic errors in the colours greater 
than ~ 0.1 mag, vindicating the choice of colour zeropoints 
given in Table m 



4 CONCLUSION 

We have presented a detailed description of the image detec- 
tion, parameterisation, classification and photometric prop- 
erties of the SuperCOSMOS Sky Survey]. With reference 
to examples from the first release of data, the South Galac- 
tic Cap (SGC) survey, we have demonstrated the typical 
completeness, classification reliability and photometric er- 
rors (both random and systematic). For a summary of these 
results along with similar comparisons for other large-scale 
survey programmes and a guide to using the SSS database, 
the reader is referred to Paper I. 
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SuperCOSMOS Sky Survey II: Image detection & parameterisation etc. 
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Table 12. Global external photometric accuracy of SGC data as a function of magnitude in the equatorial region for stars and gal 
(Ns and Nq respectively). 
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Figure 16. Comparisons of survey colours with those from Boyle 
et al. (1995): (a) Bj - R; (b) Bj - I. The dashed Unes shows 
y = X. The RMS dispersion is ~ 0.16 mag indicating the single- 
measurement errors in any colour at these magnitudes (typically 
Bj ~ 20). 
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Figure 15. Positional dependent systematic errors between SGC 
R data and the CCD photometry of Stone et al. (1999) in SDSS 
calibration region 'A'. Data are binned in ~ 2 cm bins and 
smoothed to show the global trends in the systematics; in plots (a) 
to (d) we show four 2.0 mag ranges between 10 < R < 18. The 
vertical bars are scaled such that 1° = 1 mag. 
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